PhonePe Limited — Site Reliability Engineer – Database

Posted: 03-11-2025

Salary: ₹20 - ₹32 Lakhs/Annum Expected

About the Company:

PhonePe, India’s largest digital payments platform, launched in 2016, serves more than 600 million registered users and 40 million merchants across the country. The platform processes over 330 million transactions daily with a total annualized payment value exceeding INR 150 lakh crore.

The company is expanding beyond payments into financial products such as insurance, lending, wealth management, and new ventures like Pincode (hyperlocal e-commerce) and Indus AppStore (a localized Android App Store). PhonePe’s mission is to make financial access seamless and equitable for every Indian.

About the Role:

The Site Reliability Engineer (Database) will safeguard the reliability, scalability, and performance of PhonePe’s high-volume, mission-critical database ecosystem. You will design and maintain multi-terabyte, distributed MySQL and Galera clusters across multiple data centers, leading efforts to improve operational excellence through automation, monitoring, and performance optimization.

This position is ideal for experienced database engineers who want to work on complex, high-availability systems and drive impactful improvements in large-scale infrastructure.

Key Responsibilities:

  • Design, provision, and manage large-scale MySQL and Galera multi-master clusters.
  • Implement database reliability strategies with robust disaster recovery and automation.
  • Optimize performance through tuning, indexing, query optimization, and system hardening.
  • Build automation for repetitive tasks like backups, schema updates, and replication.
  • Develop observability tools, ensuring proactive monitoring and alert reduction.
  • Perform root cause analysis for incidents and coordinate recovery actions.
  • Lead capacity planning and database scaling initiatives.
  • Mentor junior engineers and promote knowledge sharing.
  • Collaborate closely with DevOps, Infrastructure, and Application Engineering teams on improvements.

Key Technical Skills:

MySQL Administration, Linux systems engineering, Bash/Python scripting, InnoDB engine, Replication, Database clustering, Query optimization, Performance tuning, Ansible, Terraform, Prometheus, Grafana, Percona Monitoring

Requirements:

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • 4–8 years of experience as an SRE or Database Administrator in large-scale, production-grade environments.
  • Expert knowledge of Linux system internals, file systems, storage management, and debugging tools.
  • Proven experience managing 100+ MySQL production clusters over 1TB each.
  • Experience with automation tools and scripting for infrastructure management.
  • Familiarity with monitoring tools (Prometheus, Grafana, Percona Monitoring).
  • Strong incident response, troubleshooting, and documentation abilities.
  • Effective communicator with demonstrated technical leadership and mentoring experience.

Important Notice:

This job description and related content are owned by PhonePe Limited. We are only sharing this information to help job seekers find opportunities. For application procedures, status, or any related concerns, please contact PhonePe Limited directly. We do not process applications or respond to candidate queries.